Effective lexical tree search for large vocabulary continuous speech recognition
نویسندگان
چکیده
In this paper, we present an e cient calculation of the factored LM probabilities for speeding up the large vocabulary continuous speech recognition. We introduced a novel technique based on the independent calculation of the factored LM probability. The basic idea of the proposed method is that each factored LM probability is calculated on-demand for a new combination of a previous word hypothesis and a LM look-ahead tree node, instead of calculating all the factored LM probabilities over the tree at a time. The speaker-independent continuous speech recognition experiment was performed for 20 speakers on a 60k word newspaper dictation task. As a result, the proposed method achieved 25% improvement in speed.
منابع مشابه
Speech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon Language Model
This paper gives an overview of an architecture and search organization for large vocabulary, continuous speech recognition (LVCSR at RWTH). In the rst part of the paper, we describe the principle and architecture of a LVCSR system. In particular, the issues of modeling and search for phoneme based recognition are discussed. In the second part, we review the word conditioned lexical tree search...
متن کاملSpeech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon
This paper gives an overview of an architecture and search organization for large vocabulary, continuous speech recognition (LVCSR at RWTH). In the rst part of the paper, we describe the principle and architecture of a LVCSR system. In particular, the issues of modeling and search for phoneme based recognition are discussed. In the second part, we review the word conditioned lexical tree search...
متن کاملSegmental search for continuous speech recognition
The paper illustrates a search strategy for continuous speech recognition based on the recently developed Fast Segmental Viterbi Algorithm (FSVA) [5], a new search strategy particularly e ective for very large vocabulary word recognition. The FSVA search has been extended to deal with continuous speech using a network that merges a general lexical tree and a set of bigram subtrees generated on ...
متن کاملAn efficient lexical tree search for large vocabulary continuous speech recognition
This paper describes an efficient search algorithm for a high speed and high accuracy LVCSR system. A conventionally used lexical tree search is an efficient method, but has a problem in incorporating the language probability. To solve this problem, we propose in this paper a new efficient search algorithm incorporating the language model structure. In our developed LVCSR, 2-pass search algorit...
متن کاملImprovements in search algorithm for large vocabulary continuous speech recognition
Current time-synchronous beam-search algorithm is improved from two aspects for speeding up large vocabulary continuous speech recognition. Single-triphone-tree structure is proposed to take instead of the tree copy technique for simplifying the search computation and saving the memory . By one kind of special-designed token propagation strategy, the n-gram language model can be integrated into...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000